† Corresponding author. E-mail:
Three-dimensional (3D) reconstruction of icosahedral viruses has played a crucial role in the development of cryo-electron microscopy single-particle reconstruction, with many cryo-electron microscopy techniques first established for structural studies of icosahedral viruses, owing to their high symmetry and large mass. This review summarizes the computational methods for icosahedral and symmetry-mismatch reconstruction of viruses, as well as the likely challenges and bottlenecks in virus reconstruction, such as symmetry mismatch reconstruction, contrast transformation function (CTF) correction, and particle distortion.
With the development of electron microscopy, algorithms, software, computing power, and particularly, the direct electron detector (DED) camera in recent years, single-particle cryo-electron microscopy (cryo-EM) has become one of the most powerful tools in structural biology.[1,2] Icosahedral viruses have played a critical role in the development of cryo-EM; many new cryo-EM techniques were first established for structural studies of viruses. The main reasons behind the relevance of icosahedral viruses for cryo-EM development are (i) their high symmetry, which is equivalent to a 60-fold increase in data set size, and, hence, more effective noise suppression and improved reconstruction resolution; (ii) their large molecular mass, which can produce a strong image contrast allowing for more accurate determination of particle orientation parameters (three Euler angles, θ,
In the 1970 s, through 3D structural computations of the tomato bushy stunt virus, Crowther et al. established the reconstruction theories of icosahedral viruses, including the common-line method used to determine the orientation and center parameters of particles, and the Fourier–Bessel synthesis method used to calculate the 3D structures from two-dimensional (2D) images.[4,5] In 1984, the first high-contrast cryo-EM micrograph was obtained using adenovirus embedded in vitreous ice through plunge freezing,[6] which enabled preserving high-resolution information of macromolecular complexes and expedited high-resolution reconstruction. In 1997, two groups from the Medical Research Council Laboratory and National Institutes of Health,[7,8] respectively, resolved the hepatitis B capsid structure to a sub-nanometer resolution, identifying, for the first time, the secondary structure of capsid proteins from a 3D cryo-EM; their results represented a milestone in cryo-EM and an advancement toward the structural biologistʼs dream of solving the atomic structure of macromolecules without requiring crystallization.[9] In 2008, several icosahedral virus capsids were resolved to near-atomic resolution (approximately 4 Å) by single-particle cryo-EM,[10–13] enabling the development of backbone models based on cryo-EM maps. In 2010, several full atomic models of icosahedral viruses were constructed de novo by cryo-EM utilizing well-resolved side chain densities.[14–16] More recently, determining structures at a near-atomic resolution has become increasingly routine, and many icosahedral viruses have been resolved to a resolution of approximately 3 Å.[17–21]
Many reviews have been published on structural analyses of icosahedral viruses using cryo-EM.[22–24] In this review, we focus only on the methodology for icosahedral and symmetry-mismatch reconstruction of viruses by cryo-EM and the possible challenges.
Figure
The determination of the orientation and center parameters begins with an initial 3D model, which is generated from either a low-resolution negative-stain 3D reconstruction, an appropriately filtered x-ray model, an EM map of a homolog,[25] or even a random sphere. The accuracy of the orientation and center parameters of each particle image directly determines the resolution of the reconstruction. Two approaches are used to determine these parameters, which are described in the two subsequent sections.
In Fig.
To accelerate computation, the orientation and center alignment is usually performed in two steps: 2D classification (assigning particles with the same view to one class) and 3D refinement (locally optimizing around given parameters). The first step, 2D classification, is used to determine the three in-plane parameters (x, y, ω); subsequently, all particle images within the same class are averaged to improve the signal-to-noise ratio (SNR); and the second step, 3D refinement, is used to simultaneously determine the two out-of-plane parameters (θ, ϕ) and refine the three in-plane parameters. Two approaches are used to perform 2D classification. The first approach, which is implemented in EMAN[26,27] and Relion,[28] does not rely on the initial references. It determines the optimal rotational angle in-plane, then refines the center with the rotational angle, and finally uses the optimal in-plane parameters for classification. The second approach, which is implemented in Frealign,[29] first performs a rough calculation of the center, then searches the rotational relationship between raw particle and template with assigned parameters from 0° to 90°. Angles in the other quadrants, 90° to 360°, are calculated at the same time by rotating the particles 90° without interpolation.
An additional approach is polar Fourier transform (PFT), which uses an alternative sequence to assign the orientation and center parameters.[30] Each raw particle image and all projections are first resampled in the polar coordinates, and the initial particle center is determined using the cross-correlation coefficient between the raw particle image and the average image for all projections. Subsequently, the in-plane rotational angle ω is determined using the best rotational correlation between the image and all projections. Finally, an accurate particle center is calculated using the cross-correlation between the particle images rotated with ω and the optimal projection.
In cryo-EM, the 3D structure can be described as its Coulomb potential function ρ (r), which can be calculated using Fourier inversion form structure factors F(X, Y, Z). In Cartesian coordinates, the expression is written as follows:
Besides the two general methods, projection matching and PFT, another method can be used to determine the particle orientation and center of icosahedral viruses. In the 1970 s, Crowther et al. first proposed the common-line method[4,5] to determine the orientation and center parameters of EM particle images, and the Fourier–Bessel synthesis method to calculate a 3D density map. They used these methods to reconstruct two icosahedral viruses: the tomato bushy stunt virus and human wart virus.[5] In Fig.
For an individual virus particle image, an approximate center can be determined using a center-of-mass calculation or a cross-correlation calculation between the particle image and itself rotated by 180° (self-correlation); subsequently, the possible orientation can be exhaustively searched in an asymmetric unit of the icosahedron using the minimal phase residual (the difference in the phase angle of the two points on the pair of common-lines with the same resolution) between self-common line pairs. Because self-common lines are produced by the inherent symmetry of the particle, this method can be used to generate an initial icosahedral model, which is highly effective for viruses with nonsmoothed capsids, such as adenovirus[16,32,33] and cypovirus.[34]
Once an initial 3D model is constructed, several 2D projections with known orientations in an asymmetric unit of an icosahedron can be generated as references. Subsequently, for each particle image, the minimal phase residual of the cross-common lines between the particle image and each of the references can be searched exhaustively to determine the possible orientation and center parameters. Because the cross-common lines are highly sensitive to variations of the particle orientation and center, the search for the orientation and center parameters can converge quickly. Using a similar method, based on the convergence of the orientation and center parameters, more accurate orientation and center parameters can then be obtained by local refinement around the known parameters.
Compared to the different methods for determination of particle orientation and center, the goal of the projection matching method is to achieve convergence quickly to determine a particleʼs orientation and center parameters by globally searching for all possibilities through 3D refinement. To reduce the computational cost, local refinements can be performed instead, which can be started from several parameter tuples that have demonstrated better correlation coefficients in the global search. This is called a heuristics search. Subsequently, the search can be optimized by performing local refinements near the best point. However, heuristics searches may result in a local optimum instead of a global optimum, which could be achieved by 3D refinement.. The search for the orientation and center parameters in PFT is divided into three independent steps, and, because the angle calculation is performed in one-dimensional space, the computation speed is high; however, for the same reason, the precision of the calculation is limited. Owing to its fast convergence, the common-line method is widely used to resolve the 3D structures of icosahedral viruses (Table
In addition to the direct Fourier transformation method, there are two other methods for icosahedral reconstruction, Fourier–Bessel synthesis and Fourier-spherical-Bessel synthesis.
In cylindrical coordinates, the 3D density map can be expanded as
In spherical coordinates, the Fourier inversion can be written as
In theory, interpolation in various reconstruction methods could easily be done using the direct Fourier transform method, but the uneven distribution of the 2D Fourier transformations may lead to different accuracies for different interpolated lattice points in Fourier space. In the Fourier–Bessel synthesis, the calculation is fast thanks to the one-dimensional interpolation in the Fourier space. However, the basis functions used to interpolate the 3D Fourier space only have a five-fold symmetry, and not an icosahedral symmetry, which leads to a decline in accuracy; also, uneven data distribution in cylindrical coordinates leads to data loss. Compared with the other two methods, interpolation in the Fourier-spherical-Bessel synthesis is performed on each 2D spherical shell in the 3D Fourier space; this results in much more of the available data to be included in the step of linear-squares fitting. Moreover, the symmetry of the ISAFs is identical with that of the reconstructed object; therefore, the Fourier-spherical-Bessel synthesis method can improve the accuracy of interpolation and significantly reduce the influence of noise. Because the basis functions become increasingly complex as the desired resolution is improved, this method increases computing costs and likely increases calculation errors.
Symmetry mismatch is ubiquitous in virus structural organization. For example, adenoviruses comprise 12 three-fold fibers projecting from the 12 five-fold vertices of their icosahedral capsid, and many bacteriophages consist of an icosahedral head and a long tail on one unique vertex; most spherical viruses consist of a nonicosahedral genome encapsidated within an icosahedral capsid. Reconstruction in 3D of an icosahedral capsid presents the great advantage of a 60-fold symmetry, which can be used to significantly improve the reconstruction resolution. However, the 60-fold symmetry becomes a major disadvantage in symmetry-mismatch reconstruction because all nonicosahedral features are smoothed upon imposition of the icosahedral symmetry. This explains why, despite nearly five decades of extensive study of capsid structures of icosahedral viruses, there are few structural studies of the viral genomes within the capsids.
To explain the symmetry-mismatch structure, a 2D symmetry-mismatch object is used as an example: a nonequilateral triangle encapsidated by a square. When the squares are lined up (crystallized), the triangles become random (Fig.
Three major methods are available to study the symmetry-mismatch structures of icosahedral viruses. The first is based on icosahedral reconstruction; its main process comprises three steps: (i) icosahedral reconstruction is performed to obtain a 3D density map as an initial model; (ii) the icosahedral symmetry is relaxed to a C1 symmetry and the orientation of each particle image is exhaustively searched; and (iii) asymmetric reconstruction is performed without any imposed symmetry. Some bacteriophage tail structures have been obtained using this method; however, the resolution of asymmetric reconstructions tends to be much lower than that of icosahedral reconstructions; possibly because the icosahedral information in particle images causes errors when determining the asymmetric orientation. The second method is based on localized reconstruction from raw particle images; its main process comprises three steps: (i) the icosahedral orientation and center of each particle image are determined using a conventional icosahedral data processing method; (ii) the positions of 12 vertices of the icosahedron are calculated according to the known icosahedral orientation and center, and the vertices images from each 2D raw particle image are boxed; and (iii) these new images are classified and reconstructed without any imposed symmetry to obtain the asymmetric structures. Briggs et al. and Morais et al. used this method to reconstruct the nonicosahedral structures of a Kelp fly virus vertex[40] and a bacteriophage tail;[41] IIca et al. used a similar method to reconstruct the VP4 spike structure in rotavirus.[42] These authors did not obtain high-resolution structures of nonsymmetric regions of the viruses, which is probably due to bias in the nonsymmetric structural information when aligning the overlapping symmetric structural information in raw cryo-EM images.
In our study conducted in 2015, we presented a new symmetry-mismatch reconstruction method based on 2D particle image intensity extraction. Using this method, we resolved the complete structure of the genome within an icosahedral capsid, together with its associated polymerases to a near-atomic resolution. Our symmetry-mismatch reconstruction method, which took advantage of the near-atomic resolution density map of the icosahedral capsid and of the calculated icosahedral orientation and center parameters for all raw particle images, comprised three modules: nonicosahedral information extraction, initial asymmetry model generation, and symmetry-mismatch orientation determination and reconstruction. Figure
According to WPOA, regardless of CTF convolution, the intensity of a cryo-EM microscopy image can be written as follows:[43]
For each raw particle imageʼs icosahedral orientation, we randomly selected one of the 60 equivalent orientations as an initial orientation. Subsequently, we reconstructed a full virus structure (including the icosahedral and nonicosahedral components) without any symmetry imposed using raw particle images. Finally, we obtained an initial model of the nonicosahedral component by masking icosahedral capsid densities.
For each new nonicosahedral image produced in the nonicosahedral information extraction step, we projected the model of the nonicosahedral component to generate 60 projection images according to the 60 equivalent orientations of the icosahedron. Subsequently, we searched for the projection that best matched with the nonicosahedral image (the highest cross-correlation coefficient) and assigned the corresponding orientation to the new image for further analysis. Next, we reconstructed a full virus structure without any imposed symmetry using the raw particle images with their newly assigned orientations, improving the model of the nonicosahedral structure by masking the capsid structure. Finally, the improved model was used as a new reference to perform iterations of the symmetry-mismatch orientation search and asymmetry reconstruction until the orientations of individual nonicosahedral images stabilized and no further improvements to the nonicosahedral structure could be obtained.
Compared with the other two methods for symmetry-mismatch reconstruction, the major advantage of our method is that we use the extracted images to determine the symmetry-mismatch orientation and the raw particle image to reconstruct the asymmetric structure. Therefore, we avoid high-symmetry bias when determining the asymmetric orientation, and we push the resolution of the full symmetry-mismatch of the icosahedral structure to a near-atomic level.
With the development of DED cameras, the resolution of cryo-EM reconstructions for low or nonsymmetric complexes has experienced a revolution; the highest resolution so far was 1.8 Å.[1] However, determining the near-atomic structures of icosahedral viruses was already routine before the arrival of DED cameras, and essential improvements in icosahedral reconstruction have not occurred since, particularly for large viruses. Except for the change in the research focus of cryo-EM, the key reason for this is that the 3D reconstruction of icosahedral viruses suffers from methodological challenges.
To improve the reconstruction resolution of viruses, the capsid is always considered to be perfect icosahedron. However, the capsid is not, in fact, a true spherical shell; some geometrical local distortions must occur among the 20 triangular facets of an icosahedron. Moreover, tests have indicated that electron microscopes suffer from elliptical distortion of 2%–3%.[44,45] Distortions can hinder further improvement of the resolution, particularly for large particles, since distortions tend to become increasingly severe as the diameter of particles increases.
CTF appears as a vibrating curve in the Fourier space of a cryo-EM image, specifically in the high-frequency part; a small error in the defocus value could lead to a complete phase reversal. Currently available methods for CTF correction assume a constant defocus value for an entire micrograph or particle, while, in fact, a focus gradient exists within a virus particle, and the difference in defocus values between the top and the bottom of the particle is equal to the diameter of a virus particle. Overcoming this problem using current reconstruction algorithms is difficult.
All icosahedral viruses contain a symmetry-mismatch core, composed of a genome and its associated proteins, encapsidated within a capsid shell. Little structural information is currently known about the core. Our symmetry-mismatch reconstruction method is only the beginning; to resolve the atomic structure of the inner cores of viruses, more robust algorithms need to be developed.
[1] | |
[2] | |
[3] | |
[4] | |
[5] | |
[6] | |
[7] | |
[8] | |
[9] | |
[10] | |
[11] | |
[12] | |
[13] | |
[14] | |
[15] | |
[16] | |
[17] | |
[18] | |
[19] | |
[20] | |
[21] | |
[22] | |
[23] | |
[24] | |
[25] | |
[26] | |
[27] | |
[28] | |
[29] | |
[30] | |
[31] | |
[32] | |
[33] | |
[34] | |
[35] | |
[36] | |
[37] | |
[38] | |
[39] | |
[40] | |
[41] | |
[42] | |
[43] | |
[44] | |
[45] | |
[46] | |
[47] | |
[48] | |
[49] | |
[50] | |
[51] | |
[52] | |
[53] | |
[54] |